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Abstract. The de Bruijn graph B(d,3) consists of all strings of 
length 3, taken from an alphabet of size d, with edges between 
words which are distinct substrings of a word of length 4. We give 
an inductive characterization of the maximum independent sets 
of the de Bruijn graphs B(d,3) and for the de Bruijn graph with 
loops removed, for all d. We derive a recurrence relation for their 
number. 



1. Introduction 

For any positive integers d and D, the de Bruijn graph B(d, D) is 
the directed graph whose dP nodes consist of all the .D-digit words 
from the alphabet {0, . . . , d — 1}. There is an edge from a word x = 
x\ . . . %d to y = yi . . . jjd if and only if x?, . . . xd = yi ■ ■ ■ Vd-i- It is 
worth mentioning that B(d, D + 1) is the line graph of B(d, D). These 
graphs were introduced in the paper "A Combinatorial Problem" by 
N. G. de Bruijn, under the name of T-nets [5]. Since then, de Bruijn 
graphs have been used in a wide range of contexts. For example, in the 
study of normal numbers [151 H]> network topology [2j [11], quantum 
computation [13], and sequence assembly [12]. 

The graph B(d,D) contains d nodes of the form xx...xx, which 
have an edge to themselves. In a slight abuse of notation, we will refer 
to such a node as the loop x. Notice that a loop cannot be in any 
independent set of B(d,D). We study the structure of the maximum 
independent sets of B(d, 3). Since they cannot have loops, we call them 
loop-less maximum independent sets (LMISs) of B(d, 3). The loop-less 
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maximum independet sets of B(d, 3) are percisely the comma- free codes 
of length 3, introduced in [4] in an attempt to explain the genetic code, 
and later generalized in [HJ, [HI EI - Following the literature, we also study 
the maximum independent sets of the graph B(d, 3) with the edges 

removed. We call them maximum independent 
sets (MISs) of B(d, 3). As we will see, they all contain loops. Following 
[TO] , we write a(d,D) for the size of an MIS with loops and a*(d,D) 
for the size of a loop-less MIS. 



OOP 




Figure 1. The nodes of an MIS of -8(3,3) are high- 
lighted. Bold arrows indicate edges under the shift func- 
tion 9. The loops on 000, 111, and 222 are not shown. 

The maximum independent sets of de Bruijn graphs have been previ- 
ously studied ([9] and [TO] , for example). From [10], one can show that 
for d > 4, a(d, 2) = a*(d, 2) = |_cZ 2 /4j and the number of maximum 
independent sets of B(d, 2) is ( d ^ 2 ) if d is even and 2( ( . d _ 1 ^ 2 ) if d is odd. 
In the present work, we study the MISs of the graphs B(d, 3). Again, 
in [10], it was proved that 

(1) a(d, 3) = — h 1 and a (d, 3) = — - — . 

We define four functions which, together with the action of the sym- 
metric group Sd recursively generate all MISs in B(d, 3). From this, we 
deduce a recurrence relation for the number of MISs in B(d,3). 

From Theorem 14.51 in Section HJ our main result, we derive the fol- 
lowing: 
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Theorem: If we let ad be the number of maximum independent sets 
of B(d, 3), then has exponential generating function 



We obtained the initial data for this work by computing the first few 
terms of ad using the free computer algebra system CbCcA (|3J). We 
have computed all the MISs for d < 5 by an exhaustive procedure, and 
we used those sets in some of our proofs. 

The rest of this paper is organized as follows. In Section [2], we define 
two functions / and /' that take a maximum independent set of B(d, 3) 
and return maximum independent sets of B(d + 1,3). We construct 
another two functions g and g' that take a maximum independent set 
of B(d, 3) and return a maximum independent set of B(d + 2, 3). We 
also prove some basic facts about the images of these four functions. 

In Section[3]we present the interactions between the symmetric group 
and the functions defined in Section [2j We compute the stabilizers of 
the images of our four functions. We later prove that all four functions 
map orbits under the action of the symmetric group into orbits under 
that same action. Furthermore, we show that the images of all four 
functions extended to orbits are disjoint. These results are used to 
prove the main theorems of this paper in Section HJ 

In Section we show the structure of the loop-less maximum inde- 
pendent sets of B(d, 3), and show that their number coincides with the 
number of maximum independent sets of B(d, 3) with loops. 

2. Inductive Construction of Maximum Independent Sets 

In this section, we present four combinatorial operations that trans- 
form a maximum independent set in the de Bruijn graph B(d, 3) into 
a maximum independent set in either B(d + 1, 3) or B(d + 2, 3). 

Essential to the structure of the de Bruijn graph B(d, 3) are the loops 
and the ^-cycles. Lichiardopol defined a shift function 9 on the nodes 
of B(d,3) by 9{xyz) = yzx. If xyz is not a loop, then xyz, 9 (xyz), 
and 9 2 (xyz) form a directed 3-cycle. In an abuse of notation, we will 
sometimes use 9 to mark the edges of this 3-cycle. The action of 9 in- 
duces a decomposition of B(d, 3) into (d 3 — d)/3 cycles of length 3, and 
d cycles of length 1 (i.e. the loops.) Each of these disjoint cycles con- 
tributes at most one node to any independent set of B(d, 3). Whenever 
we speak of "cycles", we mean the cycles induced by 9. In Figured! 
the ^-cycles are indicated by darker cycles. 




d=i 
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Convention 2.1. Throughout this paper, we let [d] stand for the set 
{0,...,d-l}. 

Definition 2.2. Let A be a set of nodes from B(d, 3). Let x and y be 
two digits in [d]. We say that y appears between x in A if the node 
xyx G A. We define J\l x (A) as the set of digits which do not appear 
between x in A. We define m x (A) as the number of digits which do 
not appear between x in A, i.e. m x (A) = \M X (A)\. 

Proposition 2.3. A maximum independent set S of B(d,3) contains 
either one or two loops. S always contains one loop a such that m a (S) = 
0. If S contains another loop b, then m^S) = 1 and M.b(S) = {a}. In 
particular, S contains aba, but not bab. 

Proof. Suppose that an independent set S contains two loops a and b. 
Consider the two cycles aab — > aba — > baa and bba — > bab — > abb. These 
two cycles can contribute at most the nodes aba and bab, since aab, baa, 
bba and abb are adjacent to either aaa or bbb. However, aba and bab 
are adjacent to each other, so only one of them can be present in S. 
Therefore, for every pair of loops present in an independent set S, one 
of the (d 3 — d)/3 cycles contributes no nodes to S. 

If I is the number of loops in S, there are ( 2 ) pairs of loops, and 
thus (g) cycles which cannot contribute any nodes to S. Therefore, 

\S\ < (d 3 — d)/3 + I — (2), so for I > 2, S cannot be a maximum 
independent set. 

The rest of the proposition follows from the previous observations: 
S must have one loop, by cardinality. If S is a maximum independent 
set with only one loop a, then every cycle contributes one node. The 

6 6 

cycles aax axa — > xaa can only contribute axa. If S has two loops 
a and b, then only one of aba and bab can be present. □ 

Remark 2.4. The previous proof illustrates a tool we use throughout 
this paper. When dealing with an MIS S, we often choose a cycle 

xyz — > yzx — > zxy and show that two of its nodes cannot be in S, 
therefore concluding that the third one is in S. Sometimes, we show 
that none of the three are in S, thus contradicting the fact that S is 
maximum. 

Convention 2.5. From now on, we state things like "Let S be a max- 
imum independent set with loops a and possibly b," when in fact S 
may have just one loop. In that case, everything said about b should 
be disregarded. 

Definition 2.6. If w is a node, we will denote by w[x ^ y] the node 
that results from replacing every occurrence of the digit x by the digit 
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y in w. We write x G w to mean that x is one of the digits that appear 
in w. 

We denote by a the loop of S such that m a (S) = 0. If S has another 
loop we denote it b. 

Definition 2.7. Let S be a maximum independent set of B(d,3). We 
define f(S) C B(d + 1, 3) to be the union of S with the sets 



EMS) 


= {w[a - 


-> d] w G 5, a G w,w ^ aaa, w ^ aba}, 


£/ 2 (s) 


= {axo? 


x G [d]\{a,&}}, 


E/ 3 (S) 


= {cfo;a 


x G [d]\{a,6}}, 


£/ 4 (s) 


= {udv 


{a, 6}}, 


£/ 5 (s) 


= {udd 


■u G {a, 6}}. 



Remark 2.8. The loops of 5 are the same as the loops of f(S). 

Proposition 2.9. If S is a maximum independent set of B(d, 3), then 
f(S) is a maximum independent set of B(d+ 1,3). 

Proof. f(S) is made up of six disjoint sets. Let / be the number of 
loops of S. We have 

| ^(5-) | = (d-l)(d-l)-(l-l) + (d-l) = d 2 -d + 2-2l 

\U 2 (S)\ = \U 3 (S)\ = (d-l), 

\U A (S)\=l 2 , 

\U 5 (S)\=l. 

Only the cardinality of U\(S) requires explanation. Notice that B(d, 3) 
has (d — l)(d — 1) cycles which contain a once, with the exception of 



abb — > bba — > bob in the case that / = 2. Each of these contributes one 
element to S and thus to U 1 (S). In addition, S contains one element 

from each of the d — I cycles of the form aax —> axa —> xaa, where x 
is not a loop. 

We add the six quantities to obtain 

|/(g)|= (d+1) 3_ ( , + 1) +(j _ i)(i _ 2) + i 

Since / is either 1 or 2, f(S) has the size of an MIS. 

We still have to prove that f(S) is an independent set. This amounts 
to noticing that there are no arrows between the six sets defining f(S). 
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The only remark to bear in mind is that axa is in S for all x, and 
that bxb is also in S, except for x — a. We leave the details to the 
reader. □ 



We define another function very similar to /. 

Definition 2.10. Let S be a maximum independent set of B(d, 3). We 
define f'(S) C B(d + 1,3) as the union of S, the sets Ui(S), U 2 (S), 
U 3 (S), U 4 {S) from Definition [23 and 

U^(S) = {ddu | u G {a, b}}. 

Proposition 2.11. If S is a maximum independent set of B(d, 3), then 
f'(S) is a maximum independent set of B(d+ 1,3). 

Proof. This proposition is proved analogously to Proposition 12.91 □ 

Definition 2.12. Let S be a maximum independent set of B(d, 3). We 
define g(S) C B(d + 2, 3) to be the union of S with the sets 

Vi(S) = {w[a — > y] | y G {d,d + l},w G S,a G w,w ^ aaa, w ^ aba}, 

V 2 (S) = {axy | x G [d]\{a, b}, y G {d,d+ 1}}, 

Vs(S) = {yxa \ x G [d]\{a, b}, ye {d,d+ 1}}, 

Vi(S) = {yxz | y, z G {d, d + 1}, y z, x G [d]\{a, 6}}, 

^(5) = {uyv \u,v e {a, b}, y G {d,d+ 1}}, 

^(S 1 ) = {uyy | m e {a, 6}, y G {cf, d + 1}}, 

V 7 (S) = {yzu \y,z G {d,d+l},y ^ z,u G {a, 6}}, 

V8(5) = + l)(d + 1), (d + l)dd}. 

Proposition 2.13. If S is a maximum independent set ofB(d,3), then 
g(S) is a maximum independent set of B(d + 2, 3). 

Proof. g(S) is made up of nine disjoint sets. For each pair of sets, it is 
clear that there are no edges between them. We now show that g(S) 
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has the right size. If I is the number of loops of S, 



\3\ 


^3 A 

a — a 
= 3 + 1 


■ 




WAS)] 

1 ' J. V, / 1 


= 2\U 1 (3)\ = 


2{d 2 - 


-d + 2 


\V*(S)\ 


= 2\U 2 (S)\ = 


2{d- 


I), 


\Vs(S)\ 


= 2\U 3 {S)\ = 


2{d- 


I), 


\V A (S)\ 


= 2\U 4 {S)\ = 


2l\ 




\Vs(S)\ 


= 2(d-l), 






\v e {s)\ 


= 2\U 5 {S)\ = 


21, 




\V 7 (S)\ 


= 21, 






\V 8 (S)\ 


= 2. 







21), 



The sum of these sizes is 

{d + 2) 3 - {d + 2) 



\g(S)\ 



+ 2(/ -!)(/- 2) + 1. 



For / = 1 or 2, g{S) is a maximum independent set of B(d + 2, 3). □ 

Definition 2.14. Let S be a maximum independent set of B{d, 3). We 
define g'{S) C B{d + 2, 3) to be the union of S, the sets V\{S), V^S"), 
V 3 {S), V 4 {S), V 5 {S) from Definition EH and the sets 

K( s ) = {yy u i u e i a ,b}^y e {d,d+ 1», 

Vj{S) = {uyz, y,z G {d,d+l},y ^ z,u G {a, 6}}, 

V£(S) = {{d +l){d+ l)d, dd{d + 1)}, 

which are the reverses of V 6 {S), V 7 {S), and V${S) respectively. 

Proposition 2.15. If S is a maximum independent set ofB{d,3), then 
g'{S) is a maximum independent set of B{d + 2, 3). 

Proof. This proposition is proved analogously to Proposition 12.131 □ 

3. Action of the Symmetric Group 

In this section, we study the interaction between and the four 
functions we defined in the previous section. 

The symmetric group acts on the nodes of B{d,3) by a{xyz) = 
a{x)a{y)cr{z). This action preserves the graph structure, and therefore 
permutes the maximum independent sets. We will write A ~ B to 
mean A and B are two sets in the same orbit under the action of S^. 
Notice that the functions /, /', g, and g' are defined in a way such that 
if A ~ B, then f{A) ~ f{B) and so on. Therefore, we can speak of 
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each of these functions as taking an S^-orbit to an S^+i- or S^-orbit 
of MISs, accordingly. 

Proposition 3.1. Let S be a maximum independent set of B(d, 3). 
Let H, H' and H" be the stabilizers of S, f(S) and f'(S), respectively. 
Then 

H = H' = H", 

where we identify H with its image under the inclusion •— > S^+i- 

Proof. We know that H C H', and we must prove the other inclusion. 

Let a G H', and let a and possibly b be the loops of S. The set of 
loops must be preserved by a and moreover, by Proposition 12. 3^ a fixes 
each loop. We want to show that aid) = d. Suppose that a(d) = z ^ d 
and then a{w) = d, for some w ^ d. Since w is not a loop, the node 
awd then belongs to the set U2(S) from Definition 12. 7\ and so to f{S). 
That means that a(awd) = adz must be in f(S). Since it begins with 
a, and has d in the middle, it could only be in U±(S). But z ^ a,b, 
and so adz ^ U^S). Therefore, a(d) = d. 

Now, since o~(d) = d, a is also an element of S^. Furthermore, it 
must be in the stabilizer of S. Otherwise, it should map a node of S 
into a node having a d. Since this is not possible, a e H. 

The proof for H" is completely analogous. □ 

Proposition 3.2. Let S be a maximum independent set of B(d, 3). Let 
H, H' and H" be the stabilizers of S, g{S) andg'(S), respectively. Let 
r be the transposition interchanging d and d + 1 . Then 

H' = H"=(t,H), 

where, again, we identify H with its image in §d+2- Notice that r 
commutes with every element of H. 

Proof. As in the proof of Proposition 13. 1\ we know that (r, H) C H' . 
Now, let o E H' . Again, a must preserve the set of loops in g(S), and 
by Proposition I2.3[ a in fact fixes each loop. We will show that either 
cr or to fixes d and d+1. Let x, y, z and w be such that 

x h-^-*- d i— ^ y and z d + 1 1— ^ w. 

We know that x,y,z,w ^ a,b. Suppose that x ^ d,d + 1. Then we 
must have dxa G Vs(S) from Definition 12.121 Consider a(dxa) = yda. 
This node has to be in g(S), but it can only be in Vj(S). That means 
that y = d + 1. Likewise, considering 

a((d + l)za) = w(d + l)a, 
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we have w = d. So a(d) = d + 1 and a(d + 1) = d. This contradicts our 
assumption about x, and implies that x — d or x — d+1. Analogously, 
z = d+ l or z = d. That means that a fixes d and d + 1 or that it 
transposes them. Therefore, either cr or to is in if, and so o G (r, if). 
The other equality follows in much the same way. □ 

We now show the precise way in which our functions and interact. 

Lemma 3.3. Let S and S' be maximum independent sets of B(d, 3). 
Thenf(S)^f'(S'). 

Proof. For contradiction, suppose that there is a G S^+i such that 

f(S)=af'(S'). 

Let a and possibly b be the loops of S and a' = a" 1 (a) and b' = cr _1 (6) 
be the corresponding loops in 5". Let x ^ a', b', y ^ a, b be such that 



Suppose that y ^ d. Then the node ayd is in U 2 (S), and hence in 
f(S). Therefore, a^ 1 {ayd) must be in f'(S'). But a~ 1 (ayd) = a'dx, 
which cannot be in any of the sets that make up f'{S'). This implies 
that a(d) = d. In other words, a lies in the image of &d, and so 
af'(S') = f'(aS'). However, f(S) has at least one element of the form 
udd, and f'(aS') has none, so f(S) ^ f'(S'). □ 

A similar result holds for g and g'\ 

Lemma 3.4. Let S and S' be maximum independent sets of B(d,3). 
Theng(S)^g'(S'). 

Proof. This proof is similar to that of Lemma I3.3[ but somewhat more 
subtle. Suppose that there is a G Sd +2 such that g(S) = crg'(S'). Let 
a and possibly b be the loops of S and a' = cr _1 (a) and b' = o~~ l {b) be 
the corresponding loops of S' . Let x, z ^ a', b', y, w ^ a, b be such that 

x i— ^ d i — V and z d + 1 w. 

Suppose that y ^ d, d + 1. Then the node ayd is in V^S"), and therefore 
in g(S). That means that a~ 1 (ayd) = a'dx must be in g'(S'). But such 
a node does not belong to any of the sets that make up g'(S'). This 
implies that either a(d) = d or a(d) = d + 1. Analogously, we can 
prove that a(d + 1) = d + 1 or o~(d + 1) = d. 

Therefore, a transposes d and d + 1 or leaves them fixed. By Propo- 
sition [3l2j the transposition (d, d+ 1) is in the stabilizer of g'{S') and 
so by possibly multiplying o on the right by this transposition, we 
can assume that a fixes d and d + 1 and so it lies in S^. Therefore, 
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ag'(S') = g'(aS'), but g(S) has at least one node of the form udd, and 
g'{aS') has none, so g(S) g'(S'). □ 

We now state two invariants that completely characterize maximum 
independent sets in de Bruijn graphs with D = 3. This is useful to 
prove that our functions /, /' g, and g', together with the action of 
Sd, allow us to construct all maximum independent sets of B(d,3). In 
order to reverse these functions, we need the following proposition: 

Proposition 3.5. If S is a (possibly loop-less) maximum independent 
set of B(d, 3), with loops a and possibly b. Let d' be an integer such 
that a,b < d' < d. Then, 

s' = sn B(d',3) 

is a maximum independent set of B(d',3) with loops a and possibly b. 

Proof Since B(d', 3) is a subgraph of B(d, 3), S' is clearly an indepen- 
dent set. Furthermore, since S has one element from each cycle except 
possibly a cycle that only uses the digits a and b, then S' has the same 
property. Therefore, 5" has the cardinality of a maximum independent 
set. □ 

Proposition 3.6. Let S be a maximum independent set of B(d, 3) 
with I loops, where d is at least 3. There exists a digit x such that 
m x (S) — I + 1 if and only if there exist a E S d and S' a maximum 
independent set of B(d— 1,3) such that S = o~f(S') or S = af'(S'). 

Proof. One implication follows from the definitions of / and /', taking 
x = a(d — 1). 

Suppose now that there is such an x. We know it is not a loop by 
Proposition 12.31 We define the transposition a = (d — l,x) and the 
set S' = vS R B(d — 1,3), which is a maximum independent set of 
B(d — 1,3) by Proposition 13.51 

Let a and possibly b be the loops of S. We know that the node 
xax ^ S. Therefore, either xxa or axx must be in S. 

Suppose that axx G S. We are going to show that S = crf(S f ). To 
do so, we consider each of the sets that make up af(S'), and show that 
they are included in S. 

The nodes of aS' belong to S, because of the way we defined S'. 

Let us consider the nodes of aUi(S'). The nodes of this set are of 
the form xyx, xyy, yyx, xyz or yzx, for y, z ^ a, b, x. 

• The nodes of the form xyx are all in S. Otherwise, the hypoth- 
esis cannot be satisfied. 
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• If xyy G crUi(S'), then ayy G S'. This means that ayy G S, and 
so yyx cannot be in S. The node yxy cannot be in S either, 
since xyx is. So, xyy G S*. Analogously, if yyx G <rL r i(S"), then 
yyx G S*. 

• If xyz G cr£/i(S"), then ayz G S" and ay z G 5. Since neither 
zxy (adjacent to xyx) nor yzx (adjacent to ayz) can be in S, 
xyz must be in S. The same reasoning applies to yzx. 

Let us consider the nodes of aU2(S'). These have the form ayx. 
The nodes yxa (adjacent to xyx) and xay (adjacent to aya) cannot 
be in S, which implies that ayx G S. The same reasoning shows that 
aU 3 (S') C S. 

Now, take a node from aU±{S'). That is a node of the form uxv, 
with u, v loops. The nodes xuv (adjacent to uxu) and uvx (adjacent 
to vxv) cannot be in S. Therefore, uxv G S, and aU^(S') C S. 

Finally, we know that axx G S. The nodes xbx (adjacent to bxb) 
and xxb (adjacent to axx) cannot be in S. That implies that bxx G S, 
which means aU^S') C S. 

This proves that S D af(S'). By cardinality, we conclude that 
equality holds. 

If, instead of axx G S we have xxa G S, an analogous procedure 
shows that S = af'(S'). ' □ 

The following lemma is used in the proof of the next Proposition. 

Lemma 3.7. Let S be a (possibly loop-less) maximum independent set 
of B(d, 3), with d > 3. // there exist two different digits y and z, which 
are not loops, such that 

m y (S) = m z (S) =1 + 2, 

then yzy S and zyz S. 

Proof. Suppose that yzy G S. Then, by the assumptions on m y (S), 
there must be some v ^ y such that yvy (jz S. Suppose that vyy G 
S. The node zyz cannot be in S, and by the assumption on m z (S), 
zvz G S. Therefore, the nodes zvy (adjacent to vyy), vyz (adjacent 
to yzy) and yzv (adjacent to zvz) are not in S. But then the cycle 

zvy —> vyz —> yzv contributes no nodes to S, which contradicts the 
fact that S is maximum. If we assume that yyv G S, then the cycle 

yvz —> vzy —> zyv cannot contribute any node to S. 

In conclusion, our assumption that yzy is in S is inconsistent with 
S being a maximum independent set. By symmetry, the same holds if 
we assume zyz G S. □ 
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Proposition 3.8. Let S be a maximum independent set of B(d, 3), 
with d > 3. There are two different digits y and z such that 

rriy(S) = m z (S) = 1 + 2 

and no digit x such that m x (S) =1 + 1, if and only if there exist o G S^ 
and S' a maximum independent set of B(d — 2, 3) such that 

S = og{S') or S = ag'(S'). 

Proof. One implication follows from the construction of g and g' taking 
y = o~(d — 1) and z = o(d — 2). 

The proof in the other direction is analogous to the proof of Propo- 
sition 13.61 We can safely assume that y = d — 1 and z = d — 2. By 
Lemma [321 either (d - 2)(d - 2) and (d - 2)(d - l)(d - 1) are 

in S, or (d- l){d- l){d- 2) and (d - 2)(d - 2)(d - 1) are in S. In the 
former case, we find that there is an S' such that S = og(S'). In the 
latter case, we find that S = ag'(S'). □ 

Corollary 3.9. Let S and S' be maximum independent sets of B(d — 
1, 3) and B(d — 2, 3), d > 3. Then for T = f, f and Q = g, g' , we have 

F(s)^g(s'). 

Proof. This result follows from the invariants of ^(S) and G(S') that 
are stated in Propositions 13.61 and 13.81 □ 

This Corollary, together with Lemmas 13.31 and 13.41 shows that all four 
functions give rise to essentially different (i.e. in different S^-orbits) 
maximum independent sets. 

4. Characterization of Maximum Independent Sets 

In this section, we show that the functions /, /', g, and g f , together 
with the action of are sufficient to construct every maximum inde- 
pendent set of B(d, 3). For the rest of this section L will denote the 
set of loops of S, and I will denote \L\. In Section [51 we will need the 
case when S is a loop-less maximum independent set, i.e. L is empty. 

Lemma 4.1. Let S be a (possibly loop-less) maximum independent set 
of B(d, 3). There cannot be three different digits x, y, and z, with 
x,y, z ^ L, such that 

M x (S) = M y (S) = LU{x,y,z}, 
M Z (S) = L U {x, z) or LU {x, y, z). 
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Proof. Suppose that S is a maximum independent set and x, y, and z 
satisfy the given condition. Without loss of generality, we can assume 
that x, y, z, and the loops are less than 5. Then S' = S -8(5, 3) 
is a maximum independent set in 5(5,3) by Proposition 13.51 with 
M X (S') = M X (S), My(S') = My(S), and M Z (S') = M Z (S). We can 
check manually that there is no such independent set S'. Therefore, 
there is no such independent set S. □ 

Lemma 4.2. Let S be a (possibly loop-less) maximum independent set 
of B(d, 3). There cannot be three different digits x, y, and z, none of 
them loops, such that 

m x (S) = rriy(S) = m z (S) = 1 + 2. 

Proof. We prove the result by contradiction. Suppose there are such 
x, y and z. We know that L U {x} C M X (S) and \M X (S)\ = 1 + 2. 
Therefore, at least one of y and z must appear between x. An analogous 
statement holds for y and z. Without loss of generality, suppose that y 
appears between x. Then yxy (adjacent to xyx) is not in S, which forces 
z to appear between y. That, in turn, forces x to appear between z. 
That is, the nodes xyx, yzy and zxz are in S. But then, none of the 

nodes xyz — > yzx — > zxy are in S, which cannot hold. □ 

Remark 4.3. Let S be a maximum independent set of B(d, 3) with 
loops a and possibly b. There cannot be two different digits x and y 
such that m x (S) = m y (S) = 1 + 1. If there were, then xyx and yxy 
would have to be in S, leading to a contradiction. 

Proposition 4.4. Let S be a (possibly loop-less) maximum independent 
set of B(d,3), d > 3. Suppose there is no digit z such that m z (S) = 
l + l. Then, there must be exactly two digits x and y such that m x (S) = 
m y (S)=l + 2. Moreover, M X (S) = M y (S) =LU{x,y}. 

Proof. We just need to show that m x (S) = m y (S) = 1 + 2. Lemma I3TTI 
then implies that M. X (S) = 'My(S) = L U {x, y}. 

Without loss of generality, we can assume that md-i(S) < rrid-2(S) 
and 771^-2 (S 1 ) < mi{S) for all i < d — 2. We know that md-i(S) > 1 + 2 
and we want to prove that m d _ 2 (S) = 1 + 2. By Lemma W72~\ this would 
imply that d — 2 and d — 1 are the only digits with this property. 

We prove our result by induction on d. We manually check the 
result for all d < 5. Now, let d be greater than 5 and consider S' = 
S fl B(d — 1, 3). By the inductive hypothesis, we must have one of two 
possibilities: 

Case 1: S' has exactly one digit z with m z (S') = l + l. If z = d — 2, 
we are done. 
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Suppose that z ^ d — 2. By Remark 14.31 m d _ 2 (S') > m z (S'). On 
the other hand, we have m d _ 2 (S) < m z (S), and m d _ 2 (S') has to be at 
most m d _ 2 (S). Since m z (S') is at most m z (S), we must have m z (S') = 
m z (S) — 1 and m d _ 2 (S') = m d - 2 {S). This means that z(d — l)z ^ S 
and (d -2){d- l)(d - 2) G S. We then have that m d _ 2 (S) = I + 2, as 
we wanted to show. 

Case 2: S" has exactly two digits x and y with m x (S') = m y (S') = 
I + 2. We split this situation in two subcases. 

Case 2.1: We suppose x, y ^ d — 2. By an argument similar to that 
of Case 1, we know that 

M x (S) = M y (S) = L\J{x,y,d-l}, 

and that d — 1 ^ Md_2(5'). On the other hand, we know that 

rrid-^S) < m x (S) = 1 + 3 = m d _ 2 (S). 

By cardinality, one of x or y appears between d — 1 in S. Without 
loss of generality, suppose (d — l)x(d — 1) 6 S. None of the nodes 
x(d - 2)(d - 1), (d - 2)(d - l)x and (d - l)x(d - 2) can be in S, 
because they are adjacent to (d — 2){d — l)(d — 2), (d — l)x(d — 1) and 
x(d — 2)x, respectively. Thus, S cannot be a maximum independent 
set, a contradiction. 

Case 2.2: Either x or y equals d — 2. Suppose y = d — 2. Since 
md _ 2 (S') = / + 2, if (d - 2)(d - l)(d - 2) G 5, then m d _ 2 (S) = Z + 2, 
and the result follows. Therefore, we assume (d — 2)(d — l)(d — 2)^S. 
This forces x(d — l)x ^ S as well. 

We know that 

(2) M X (S) =M d - 2 (S) = LU{x,d-2,d- 1}. 

Since rrid-i(S) < m d ^ 2 (S) = 1 + 3, there can be at most two digits, 
besides the loops and itself, which do not appear between d — 1 in S. 
Call them u and v (potentially, u = v.) 

Case 2.2.1: Suppose u ^ v. We assume u,v ^ d — 2. That means 
that (d— l)(d — 2)(d— 1) G S 1 . Since u ^ v , we can assume without loss 
of generality that u ^ x. Then arux G 5 and (d — 2)u(d — 2) E S. The 
nodes (d — l)u(d — 2) and (d — 2)u(d — 1) must be in S, because the 
rest of the nodes in their cycles are adjacent to something just shown 
to be in S. We know that (d — l)u(d — 1) ^ S, because of the very 
definition of u. Plus, the nodes u(d — l)(d — 1) and (d — l)(d — \)u 
are adjacent two one of the two nodes we just mentioned being in 5*. 
Therefore, neither of them belong to S. That gives us a contradiction. 

Therefore, one of u, v must be d — 2, and so we have ([2]) and 

M d _i(£) = L U {u, d - 2, d - 1}. 
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We want to show that u = x. Assume the contrary. Then xux and 
{d — l)x(d — 1) are in S. Therefore, by inspecting their cycles we see 
that both xu(d — 1) and (d — l)ux must be in S. On the other hand, 
either u(d - l)(d - 1) G S or (d - l)(d - l)u G 5. 

However, tt(<i — — 1) G S 1 implies — 1) £ S, and (d — — 
l)w G 5 implies (d — l)ux S. Therefore, u — x. By Lemma 14.11 
applied to x, d — 1 and d — 2, this is a contradiction. 

Case 2.2.2 u = v. If we assume u/x and u ^ d — 2 and proceed 
as in the previous case, we get a contradiction. Therefore, w = x or 
u = d — 2. In either case, Lemma 14.11 applied to x, d — 1 and d — 2 
leads to a contradiction. □ 

We now state our main result. 

Theorem 4.5 (Characterization of the Maximum Independent 
Sets of B(d, 3)). For a// positive d we have: 

(1) ylny orfrzt of independent sets of B(d,3) under the action ofSd 
is obtained from the orbit of {000} under Si and the orbit of 
{000,010,111} under S 2 by a unique sequence of applications 
of f, f, g, and g'. 

(2) Let S be an MIS ofB(d, 3). Then the stabilizer of S is generated 
by disjoint transpositions. In particular, this implies that the 
size of the stabilizer of S is a power of 2. 

(3) Let bd,k be the number of orbits of MISs in B(d,3) whose ele- 
ments have stabilizers of size 2 k . Then we have the recurrence 
relation 




for d > 3, 



and the generating function 




d=l k=0 



(4) Let aa be the number of maximum independent sets of B(d, 3). 
Then ad satisfies 



a x = 1, 
a 2 = 6, 

ad = 2dad-\ + d{d — l)ctd-2 for d > 3, 
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and has exponential generating function 



E 

d=l 



a d t a 
d\ 



t + t 2 



l-2t-t 2 ' 



Proof. For d = 1, the only maximum independent set 5(1,3) consists 
of the unique node {000}. For the case of d = 2, it can be checked 
manually that the three orbits of maximum independent sets under $2 
are the orbits of {000,010,011}, {000,010,110}, and {000,010,111}. 
Note that the first two of these are /({000}) and /'({000}) respectively. 

Thus, the existence statement in ([1]) follows from Propositions 14.41 
13.61 and 13.81 The uniqueness comes from Lemmas 13.31 13.41 and Corol- 
lary EH 

The statements in ([2D and ([3]) follow from the previous result and 
the description of the stabilizers in Propositions 13.11 and 13.21 

Finally, the generating function in (T4J) is obtained by substituting 
s = 1/2 into the previous generating function, because 



ad 



E 

fc=0 



dlb djk 
2 k 



The recurrence follows immediately. 



□ 



Remark 4.6. The sequence is under A052608 in Sloane's Encyclo- 
pedia of Integer Sequences 



For illustrative purposes, we show the values of bdk, for all d < 6. 



k\d 


1 


2 


3 


4 


5 


6 





1 

(1,0) 


3 

(2,1) 


6 

(4,2) 


12 

(8,4) 


24 
(16,8) 


48 
(32,16) 


1 






2 

(2,0) 


10 

(8,2) 


32 
(24,8) 


88 
(64,24) 


2 










4 
(4,0) 


28 
(24,4) 



In each entry, the first number between parentheses indicates the 
number of orbits whose elements have only one loop, whereas the sec- 
ond number indicates the number of orbits whose elements have two 
loops. 



5. Loop-less Maximum Independent Sets 



In this section, we analyze the number of loop-less maximum inde- 
pendent sets (LMISs) of B(d, 3), for all d. Recall from the introduction 



MAXIMUM INDEPENDENT SETS OF DE BRUIJN GRAPHS 



17 



that the size of an LMIS of B(d, 3) is 

d 3 - d 

a*(d, 3) = — - — = a(d, 3) — 1. 

By MIS, we will continue to mean a maximum independent set with 
loops. 

Definition 5.1. Let S be a maximum independent set of B(d, 3), d > 3 

with loops a and possibly b. We define 

(3) 

S\{aaa} if S has only one loop, 

h(S) = ^ S\{aaa, bbb, aba} U {aab, bba} if S has two loops a < b, 
S\{aaa, bbb, aba} U {baa, abb} if S has two loops a > b. 

Proposition 5.2. Let S be a maximum independent set of B(d, 3). 
Then h{S) is an LMIS of B(d,3). 

Proof. Let S be an MIS of B(d, 3). If S has only one loop, then elimi- 
nating it leaves us with an independent set of the correct size. 

If S has two loops, then h(S) is a set of the correct size, since the 
nodes we added were not already present in S. However, we must see 
that h(S) is an independent set. Assume a < b. Suppose we have a 
node adjacent to aab. Then it is of the form abx or xaa. Since bxb and 
aaa are in S, then abx and xaa cannot be in S. Suppose now that we 
have a node adjacent to bba. Then it must be bax or xbb. Again, we 
know that aba and bbb are in S. Therefore, the nodes we add are not 
adjacent to any other nodes in the construction, and the result follows. 
The case a > b is proved analogously. □ 

Proposition 5.3. The function h is injective. 

Proof. Let S and S' be two different MISs of B(d, 3). Then showing 
that h(S) 7^ h(S') is just a matter of analyzing all the possible com- 
binations of loops and their relative order in S and S'. We leave the 
details to the reader. □ 

Lemma 5.4. Let S be a maximum independent set with two loops a 
and b. Let r be the transposition of a and b. Let S' = S\{aaa, bbb, aba}. 
Then S' = tS'. 

Proof. We must show that for every node w G S' such that a G w, we 
have w[a — > b] G S' and vice versa. 

Notice that any node of S' cannot contain a and b simultaneously. 
The nodes that contain two a's or two 6's are axa and bxb, and they 
are in S' for all x ^ a,b. Thus, xay £ S' for all x,y ^ a, so the nodes 
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that contain only one a are xya or axy for x,y ^ a. If xya G S', then 
bxy S', and so xyb must be in S 1 in order to have one element from 
its cycle. We can prove that axy G S' implies bxy G S' in a similar 
fashion. □ 

Proposition 5.5. The function h is surjective. 

Proof. Let S be an LMIS of B(d,3). By Proposition 14.41 we have two 
possibilities: 

If there is a digit x such that m x (S) = 1, then there is no node of 
the form xxy or yxx. Therefore, 5" = 5* U {xxx} is an MIS of B(d, 3), 
and S = h(S'). 

On the other hand, if there are two digits x and y such that m x (S) = 
rriy(S) = 2, then we have either xxy,yyx G S or yxx,xyy G S. In the 
first case, we construct 

S' = S U {xxa;, y?/?/, a;?/x}\{a;a;|/, 

If x < y, then S = h(S'). If x > y, then S = h(rS'), where r is the 
transposition of x and y. The remaining case is dealt with analogously. 

□ 

Theorem 5.6. Let a* d be the number of loop-less maximum independent 
sets of B(d,3) . Then a* d = ad- 

Proof. Propositions 15.31 and 15.51 show that there is a bijection between 
the set of MISs and LMISs of B{d, 3). □ 
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